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Abstract 

We significantly extend recently developed methods to faithfully reconstruct 
unknown quantum states that are approximately low-rank, using only a few mea- 
surement settings. Our new method is general enough to allow for measurements 
from a continuous family, and is also applicable to continuous-variable states. As a 
technical result, this work generalizes quantum compressed sensing to the situation 
where the measured observables are taken from a so-called tight frame (rather than 
an orthonormal basis) — hence covering most realistic measurement scenarios. As 
an application, we discuss the reconstruction of quantum states of light from homo- 
dyne detection and other types of measurements, and we present simulations that 
show the advantage of the proposed compressed sensing technique over present 
methods. Finally, we introduce a method to construct a certificate which guaran- 
tees the success of the reconstruction with no assumption on the state, and we show 
how slightly more measurements give rise to "universal" state reconstruction that 
is highly robust to noise. 

1 Introduction 

One of the most fundamental tasks in quantum mechanics is that of quantum state 
tomography, i.e., reliably reconstructing an unknown quantum state from measure- 
ments. Specifically in the context of quantum information processing in most experi- 
ments one has to eventually show what state had actually been prepared. Yet, surpris- 
ingly little attention has so far been devoted to the observation that standard methods 
of quantum state tomography scale very badly with the system size. Only quite re- 
cently, novel more efficient methods have been introduced which solve this problem 
in a more favorable way in the number of measurement settings that need to be per- 
formed H] 12] [3j |4] |5] IS [T2]| . This development is more timely than ever, given that 
the experimental progress with controlled quantum systems such as trapped ions is so 
rapid that traditional methods of state reconstruction will fail: E.g., 14 ions can already 
be controlled in their quantum state |7|. Hence, further experimental progress appears 
severely challenged as long as ideas of reconstruction cannot keep up. Such new meth- 
ods are based on ideas of quantum compressed sensing |[T]|2l|6| — inspired by recent 
work on low -rank matrix completion mm — or on ideas of approximating unknown 
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quantum states with matrix-product states f4\. Indeed, using methods of quantum com- 
pressed sensing, one can reduce the number of measurement settings from — 1 in 
standard methods to 0{rn log^ n) for a quantum system with Hilbert space dimension 
n, if the state is of rank r. This is efficient in the sense that the number of measurements 
required is only slightly greater (by an 0(log^ n) factor) than the number of degrees of 
freedom in the unknown state. 

These ideas have so far been tailored to the situation where observables are taken 
from an orthonormal operator basis, which is not always the natural situation at hand. 
In this paper, we introduce a theory of state reconstruction based on quantum com- 
pressed sensing that allows for continuous families of measurements, referred to as 
tight frames, which can be thought of over-complete, non-orthogonal generalization of 
operator bases. These settings are particularly important in the context of continuous- 
variables, which are notably used to describe quantum optical systems beyond the 
single-photon regime. These have drawn a considerable amount of research, both ex- 
perimentally and theoretically, due to very desired features such as easy preparation 
and highly efficient detection. Note that when talking about a measurement, we always 
mean the estimation of an expectation value of an observable for which, of course, 
several repetition of some experimental procedure are necessary. In this paper we are 
mainly concerned with the number of distinct observables or measurement settings that 
are needed for tomography}^ 

In this work, we make significant progress towards a full theory of efficient state 
reconstruction via compressed sensing: 

1 . We introduce new incoherence properties for tight frames, that are sufficient to 
ensure efficient compressed sensing for low-rank states. This uses an extension 
of the "golfing" proof technique of |[T1 |2l. We give examples of tight frames 
that satisfy these properties. In addition, we show that, if one only wishes to 
reconstruct "typical" or "generic" low-rank states, there is a much larger class of 
tight frames that also lead to efficient compressed sensing. 

2. We also describe a way to certify a successful reconstruction of the state, making 
our protocol unconditional and heralded. In this way, one does not need to make 
any a priori assumptions on the unknown state. Our method uses convex duality, 
and is different from other approaches to certification that focus mainly on pure 
states im m [TT] [T2l . Also, we discuss the robustness of the procedure under 
decoherence, imperfect measurements, and statistical noise. We show that, as 
long as all those effects are small, it is possible to certify that the reconstructed 
state is close to the true state. 

3. We show that, using an incoherent tight frame, and a slightly larger number of 
measurements, one can achieve universal state reconstruction: a single fixed set 
of measurements can simultaneously distinguish among all possible low-rank 
states. This is a qualitatively stronger claim than those shown above, and it is 
obtained using a different technique, based on the "restricted isometry property" 
(RIP) 16jl13J. This implies strong error bounds, showing that our procedure for 
state reconstruction is robust to statistical noise, and that it works even when 
the true state is full-rank with rapidly decaying eigenvalues (in which case our 
procedure returns a low-rank approximation to the true state)]^ 



'Other work addresses the number of copies of the unknown state that must be provided [10] — that is, 
the sample complexity of tomography. 

^As a side note, the RIP-based analysis also shows that the compressed sensing state estimator is nearly 
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4. We show how our theory can be appHed in reaHstic experimental scenarios, in- 
volving pointwise measurements of the Wigner function, and homodyne detec- 
tion. 



5. We demonstrate numerically that compressed sensing outperforms the naive ap- 
proach to tomography not only in the asymptotic limit of large systems but also 
for system sizes commonly accessible in present day experiments. 

This article is organized as follows: We start by introducing quantum compressed 
sensing in the general setting described by tight frames in Section 2. After discussing a 
suitable notion of efficiency, we show in Section 3 that efficient compressed sensing is 
possible if the tight frame fulfills certain incoherence properties. Section 4 is devoted to 
certified compressed sensing. We discuss how to certify the success of the reconstruc- 
tion without prior assumptions on the tight frame, both in the ideal case and under the 
effects of errors. In Section 5 we show universal state reconstruction and error bounds. 
In Section 6, we investigate applications of the formalism to two common classes of 
quantum optical experiments; and in Section 7, numerical data, showing the efficiency 
for small systems, is presented. 

2 Quantum compressed sensing 

Consider a quantum system with Hilbert space dimension n. In most cases of interest, 
n is very large, but the states one wants to reconstruct are approximately low-rank, that 
is, they are well-approximated by density matrices having rank r <C n. (Pure states 
correspond to the case where r = 1.) When dealing with continuous-variable systems, 
we will truncate the infinite-dimensional Hilbert space and choose n to be some large 
but finite cutoff. This is unavoidable, if one wants to do tomography as one cannot 
reconstruct a state that contains an infinite number of completely independent param- 
eters. However, in most experimentally relevant situations, e.g., continuous-variable 
fight modes with finite mean energy, all states can be arbitrarily well approximated by 
finite-dimensional ones. We will elaborate on this claim when discussing other sources 
of errors such as decoherence or imperfect measurements. 

Compressed sensing contains two key ideas. First, rather than measuring all de- 
grees of freedom, it is sufficient to measure a randomly chosen subset of about rn de- 
grees of freedom, provided these degrees of freedom satisfy certain incoherence prop- 
erties. Secondly, one can reconstruct the state using an efficient algorithm. The obvious 
approach of searching for the lowest-rank state compatible with the measurement re- 
sults leads to a computationally intractable problem (generally NP-hard). Instead, one 
can perform a convex relaxation, and minimize ||.||i instead of the rank. Here ||.||p 
stands for the Schatten p-norm: ||.||i, ||.||2, and ||.|| = ||-||oo are respectively the trace 
norm, Frobenius norm, and spectral (or operator) norm. 

Let us denote the m measured observables, i.e. Hermitian matrices, by wi , . . . , Wm, 
and suppose that we estimate their expectation values (by measuring many copies of 
the unknown state). Knowing these expectation values (for an unknown state p) is 
equivalent to knowing the value of the sampling operator Tl{p), where we define 




(1) 



i=l 



minimax-optimal 1131 , and it implies nearly-optimal bounds on the sample complexity of low-rank quantum 
state tomography 1101 . 
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where {A, B) ~ Tr{A'^ B) is the Hilbert-Schmidt scalar product. In all of our com- 
pressed sensing schemes, wi , . . . , Wm will be chosen independently at random from 
some distribution ^. The sampling operator 7?. is a linear super-operator on the dP- 
dimensional real vector space of Hermitian matrices, or operators, S((D"). Such super- 
operators will always be denoted by capital script letters. Sometimes we will use the 
notation TZa, multiplying the "matrix" TZ by the "vector" cr; this means the same thing 
as 7?.(cr). 

Let p be the unknown state. In the ideal case, with perfect measurements and no 
statistical noise, we measure Tl{p) exactly. Then the procedure to reconstruct p can be 
written as 

min ||cr||i, subject to 7^(ct) = 7^(/9). (2) 

crGB(C") 

Note that a quantum state p is a Hermitian matrix with the additional properties p > 
and Trp = 1. However, the reconstruction procedure (j2]) does not make use of this 
property and is, therefore, also applicable in more general settings, e.g. matrix com- 
pletion. This problem can be stated as a semi-definite program (SDP) and, therefore, 
solved efficiently with many well-developed tools. 

In the case of noisy data, we measure Tl{p) approximately, that is, we measure 
some b such that \\h — 'R.{p)\\ < 6, for some norm || • || and tolerance S that are chosen 
depending on the kind of noise that is expected. The constraint TZa ~ TZp in (|2]i can 
then be replaced by ||7?.(a) — 6|| < S, which implies \\TZ{(j — p)|| < 2d. 

We remark that equation (|2| is the key to certifying our estimate for p. Notice that 
if the solution a* of (j2| is unique and fulfills ||cr*||i = 1, then it must be the case that 
cr* = p. We will show later on how one can test the uniqueness of the solution a*, 
without assuming anything about p. (This can be adapted to work with noisy data, 
without assuming anything about the noise.) 



2.1 Measurements and tight frames 

When we talk about a compressed sensing scheme, we mean any protocol based on 
the reconstruction procedure Q, with some choice of measurements described by the 
sampling operator ([TJ. In Refs. ljj|2J, the observables were required to be chosen uni- 
formly at random from an operator basis. We substantially generalize these techniques, 
using the notion of a tight frame, which naturally captures many useful quantum mea- 
surements: 

Definition 1 (Tight frame). Let p be a probability measure on some set S, and for 
every a G S, let Wa be an observable, i.e., a Hermitian operator, and let Va be the 
(unnormalized) orthogonal projector which acts as Va ■ c ^ {wa, cr)wa. We say that 
{wa)aeS is o tight frame if 

J VMa) = (3) 

This can also be written as ]E„(ri^7'Q,) = 1 where a is drawn according to p. 
Because we deal with randomly drawn operators very often, a will usually denote a 
random element of S that has distribution p. Note that we do not require that || Wq||2 — 
1 for all a as it will be convenient in many applications. However, we do require a 
weaker normalization condition: ]Ea[||wc,||2] = 1 which follows by taking the trace of 

We also define a generalized notion of a tight frame, where the sampling operator is 
not a sum of projectors; we will need this later to model homodyne detection on optical 
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modes, where a single measurement setting provides more information than only one 
expectation value. 

Definition 2 (Generalized tight frame). Let fi be a probability measure on some set 
S, and for every a ^ S let Qa be a positive operator We say that {Qa)aeS forms a 
generalized tight frame if 

j QadAi(«) = ^. (4) 

We note that the formalism can be also applied to 8-port homodyne detection which 
corresponds, for a single mode, to projections on coherent states |a) with a e (D. 



2.2 Uniqueness of the solution to (|2|) 

For p to be the unique solution to Q, any deviation A must be either trace-norm 
increasing, i.e., \\p + A||i > ||p||i, or infeasible, i.e., 7?.A ^ 0. This is done by 
decomposing A into a sum Ay + A;^, and then showing that, with high probability, in 
the case where At is large, A must be infeasible, while in the case where At is small, 
A must be trace-norm increasing. Here, we denote by T the real space of Hermitian 
matrices that send the kernel of p on its image. In other words, the elements of T are 
the Hermitian matrices a whose restriction on and to the kernel of p, i.e. tto-tt where tt 
is the orthogonal projection on Ker p, is equal to 0. Vt denotes the projection on this 
space T. 

Again, in the actual reconstruction, no assumptions have to be made concerning p 
or T. Theorem [T] gives a sufficient condition for uniqueness. The sign function sgn 
of a Hermitian matrix is defined by applying the ordinary sign function to the matrix' 
eigenvalues. 

Tlieorem 1 (Uniqueness of the solution). LetY e langeTZ, and set (a) ci :— \\VtY — 
sgnplla, (b) C2 \\V^Y\\, and(c) C3 := llVrnVT - Vt\\- If 

^{l-C2)J^—^-ci>0, (5) 

then the solution to Q is unique. 

Proof: A must be infeasible if ||7?.A|| > which is the case if 

WtzAtWI = (7^AT,7^AT) > \\nAj^\\l (6) 

The right-hand side is bounded as ||7^A^||| < ||7^||2|| A^||| < n^HA^Hl while the 
left-hand side fulfills 

||7^AT|l2 =(7^AT,7^AT) > —(At^TZAt) 

m 

9 

>—{l-\\VTnVT-VT\\)\\AT\\l. (7) 

m 

Thus, (|6]l is satisfied if 

— (1 - WTtHVt - VtW) \\At\\1 > n«||A^||2, (8) 
m 
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which, using definition (c), is equivalent to 



||A^||2<^||At||2a/^^- (9) 
Using the pinching lfT9l and Holder's inequalities, as detailed in Ref. yields 

||p + A||i > ||p||i + (sgnp + sgnA^,A). (10) 
The second term is equal to 

(sgn p^Y, At) + (sgn A^ -Y,A^) (1 1) 

which is, according to (a) and (b), larger than 

||A^||2-c2||A^||2-ci||At||2. (12) 

Inserting this into (j9]l gives rise to condition (j5]l and concludes the proof. If all the 
elements in the tight frame fulfill 1 1 Wa 1 1 2 = 1 we call it normalized and one can bound 
||7?.|| < n^. In this case (jsjl in Theorem[r|can be weakened to 



-(l-C2)W^^-ci >0. (13) 
n \ m 



2.3 Efficient quantum compressed sensing 

Let p be a state of dimension n and rank r. In the compressed sensing method of 
tomography, we choose m observables wi, . . . , Wm randomly from some distribution, 
measure their expectation values with respect to p, then solve (j2]) to obtain a* , which 
is our estimate of p. 

For a given state p, there is some probability Pf{p) that the procedure may fail 
(i.e., it may return a solution a* that is not close to p). Note that this probability p/(p) 
is taken with respect to the random choice of Wi, . . . , Wm, and the random outcomes 
of the measurements. We say that the method succeeds with high probability if, for 
every low-rank state p, the failure probability ispf{p) small. Equivalently, the method 
succeeds with high probability if, 

for every low-rank state p, most choices of the observables Wi, . . . , Wm 
can be used to successfully reconstruct p. 

Now, the basic question is: how large does m have to be, to ensure that the method 
succeeds with high probability? A common situation is that the system under consider- 
ation consists of k subsystems with local Hilbert space dimension d; then n — . Of 
course, no method of tomography can counter the exponential growth of the required 
number of measurements in k. Thus, efficiency needs to be regarded relative to the 
■n? — 1 measurements necessary for standard tomography. As even a pure state needs 
0(n) parameters to be described, this also is a lower bound to the number of observ- 
ables that need to be measured. We allow for an additional polylogarithmic overhead 
and define efficiency as follows: 

Definition 3 (Efficient quantum compressed sensing). Compressed sensing for a state 
p (with dimension n and rank r) is regarded as efficient if: The number of measured 
observables satisfies m = 0(nr polylog(n)), and the probability of failure satisfies 
Pfip) < 1/2. 
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If this is the case, pf{p) can be made arbitrarily small by repeating the protocol and 
using a majority vote among the reconstructed states to get the final result. Then, the 
probability of failure decays exponentially in the number of repetitions. 

Note that this is a very stringent definition of efficiency. One can also merely ask 
for any scaling of m in o(n^). Of course, this weaker condition is easier to satisfy, as 
we shall see later on. 



2.4 Sufficient conditions for efficiency 

The general theory of quantum compressed sensing, which will be developed here, re- 
Ues heavily on and significantly extends the analysis for the special case where the ob- 
servables form an operator basis in Ref. [2]. The hypothesis for Theorem[TJis fulfilled 
if ci < l/(2n*), C2 < 1/2, C3 < 1/2 under the additional condition m < n"/2, which 
can be safely assumed to be true as we are only interested in the regime of m ^ n^. 
For normalized tight frames, the first condition can be weakened to ci < l/(2n^). We 
show conditions to the tight frame under which those conditions are fulfilled with high 
probability. 

For efficient compressed sensing to be possible, the observables Wa need to fulfill 
certain incoherence properties. Roughly speaking, the observables are "incoherent" 
if they have small inner product with every possible state one wishes to reconstruct. 
For example, operator norm can be a measure of incoherence for reconstructing pure 
states, since WwaW = niax^^|^)^i('0|it;ct|-0). We distinguish two general cases (which 
we will define more precisely in the following sections): 

1. "Fourier-type" compressed sensing, where almost all of the observables have 
small operator norm. In this case, efficient compressed sensing is possible for 
any low-rank state. 

2. "Non-Fourier type" compressed sensing, where the observables may have large 
operator norm, but efficient compressed sensing is still possible for certain re- 
stricted classes of states, e.g., generic states. 



2.5 Fourier-type efficient compressed sensing 

The efficiency of a tomography protocol, as given in Definition [3] is a statement about 
a family of procedures acting on systems with growing dimension n. We now give 
a sufficient condition for a family of tight frames to allow for efficient compressed 
sensing. 

Theorem 2 (Fourier type). Let {wa{n))a£S be, for anyn > 0, a tight frame. Let p{n) 
be any state with dimension n and rank r. Let v — 0(polylog(ri)). Set C{n) := {a G 
S : ||wQ(n)|p > v/n} and let p{C{n)) be the measure of this set. If 

t^iCin)) < ^ ^ , (14) 

efficient compressed sensing is possible for the family of states p. 

Here, the underlying "incoherence property" is the bound on the operator norm of 
the observables, 

\\wM?<^h, (15) 
which holds for "most" choices of a. If there is no risk of confusion, we will omit the 
explicit dependencies on n. 



7 



2.5.1 Perfect Fourier- type case 

We have to first consider the case /i(C) = 0. Even though the proof in Ref. Q can 
be applied with only minor changes, we state it in a way as complete and still non- 
technical as possible where we focus on the asymptotic behavior and do not provide 
explicit constants. We need Lemma 5 from Ref. Q which reads: 

Lemma 1 (Large deviation bound for the projected sampling operator). For all t < 2 

P [WVt'R-T't - VtW >t]< 4nrexp ( -^-^^ , (16) 



where k — m/{nr) is the oversampling factor which must fulfill k — 0(polylog(n)) 
for efficiency. 

The tool to prove Lemma [T| and other bounds of this form is provided by the 
operator-Bernstein inequality which was first given in Ref |17| and which we state 
here without a proof. 

Lemma 2 (Operator-Bernstein inequality). Let (Xi)i=i,...,T?x be i.i.d. Hermitian matrix- 
valued random variables with zero mean. Suppose there exist constants Vo <^nd c such 
that \\Wi(Xf)\\ < Vq, \\Xi\\ < c where the latter needs to be true for all realizations of 
the random variable. Define A = Xi and V — mV^. Then, for all t < 2V/c 

P[|1A|| >t] <2nexp(^-^^ . (17) 

The proof of Lemma [T| is given in Ref. ^ but we restate it here because it is 
quite instructive. Let a be a random variable taking values in S. We define m ran- 
dom variables by Z^. = {n'^ /m)VT'Pai'PT and X^. = Z^, - ]E(^aJ- Now S ^ 
VtT^'Pt ^ 'Pt ~ J2i -^cti and we have to estimate the maximum of WX^^ \\ and the 
norm of the variance of X^^ in order to apply Lemma|2j From the incoherence condi- 



tion (15 I, we get by using the matrix Holder inequality |[I9 



WVrwJi^ sup iwc,,ay <2iy-. (18) 

ctGT,||o-||2 = 1 " 



This allows us to write 



2nvr — 1 ,, ,, 2v 

< ^\\Vt\\ < — (19) 

m'' rriK 



and 



l^aJI ^-\\n^VTVo.,rT-VT\ 

m 



2 



<-\\n'VTVc.^VT\\ - —\\VTWo.^\l 

m m 
2v 

< — ■ (20) 

K 

Here, and in the remainder, statements of the form ( |20] l are meant to hold for all real- 
ization of the random variable as needed in the Operator Bernstein inequality. Inserting 
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now ( [T9| a nd ( 20 1 into Lemma|2]yields Lemma[T]which concludes the proof. Applying 
Lemma 111 for t = 1/2 and choosing n = 0(polylog(ri)), the probability that C3 > 1/2 
can be made arbitrarily small. 

Now we have to construct a certificate Y whose projection on T is close to sgn p. 
This is done by an iterative process, called the golfing scheme [2 |. The m samples are 
grouped into I groups which are indexed by i and contain nii samples each. Let TZi be 
the sampUng operator of the ith group and set Xq = sgn p, Xi ~ (1 —VTT^i'PT)Xi-i, 

= E}=i7^,^j-i,andy = Fi. 

Again, Lemma [T] can be used to show that with high probability (at the expense of 
a polylog growth of Ki) 

\\x,\\2 < \\VTn,rT-VT\\\\x,-i\\2 < ^ll^^-ilb, (21) 

and, therefore, |lXi||2 < \/r2^* from which we get 

ci = \\Xi\\2<V^2-' (22) 

while for the final inequality to hold it is enough to set I = Q{\og{y/rn)). For the last 
remaining condition we need the subsequent statement: 

2 



Lemma 3 (Bound for the orthogonal projection). Let F £ T and t < ^2/r\\F\\2- 
Then 

P WW^TZFW >t\< 2nexp (-^^) ■ (23) 



Proof: Without loss of generality, consider ||F||2 = 1 and define the zero-mean 
random variables X^^ = (n^/m)7'^Wc(. (wq. , F) which fulfill X^i = Vj^TZF. 
Their variance is bounded by 

mxi,)\\ <4 / A{t,){w^,Ff\\{v^w^n 

TO^ J 

< — , (24) 
rriKr 

and their norm by 



\\X.^\<'^J^^^. (25) 
Lemma [3] follows directly from using (|24]) and ( [25] l in Lemma|2] Now we can bound 

c2 = \\V^Y\\<\Y.2~^''-'^^ <\. (26) 
1=1 

Again, the probability of ( [26] l not being true can be made as small as desired by choos- 
ing K = 0(rTO polylog(n)). Of course, this is also true for the total probability of 
failure which concludes the proof. 
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2.5.2 Imperfect Fourier-type case 

We now show that the incoherence condition may be violated for some of the ob- 
servables and adapt a technique used in Ref. (14). Intuitively, when /i(C) is small 
enough, we can just abort and restart the reconstruction procedure whenever we en- 
counter a non-incoherent operator during our sampling process. The probability of this 



to happen is upper bounded by {16^/rn^)^^ as obtained from (14 1 by a union bound 
over the m measurements. This is equivalent to sampling only from the set S \ C. 
The conditional probability distribution on the observables does fulfill the approximate 
tight-frame condition 

IIW-III < l/(8yf), (27) 

where W — n?'E{'Pa\E) where E is the event that all of the m chosen operators satisfy 
\\wai IP < i^/n and its complement is denoted by E"^. Let be the indicator function 
ofE. Then, 1 = n'^'E{'Pa) = n'^'E{ValE) + n^ECPalijc). This leads to 

Wn^-EiVc^lE) - 1\\¥{E) = ||(1 -P(S))1 - n2E(7'„l£c)|| 

<¥{E')+n^\\MiVa.lE^)\\. (28) 
With the help of Jensen's inequality, we can simplify ||]E(7'q1£;c)|| < l£;c) — 



'P{E'^). Inserting this into (28 i and rearranging, we get 



Wn^nW) 111 < < 2n'¥{E^). (29) 

Our claim follows by taking P(£''^) — l/{16y/r) which is always true by a union 
bound. We now have to justify why the tight frame condition ([3]l can be replaced by 
the approximate one in Eq. ( 27 1 in the proof of Lemma[T]and Lemma[3] We denote the 



probability measure which is conditioned on the event E by /i. 
Lemma[T]provides a bound to 

\\VT{n-i)VT\\ < \\VT{n-w)VT\\ (30) 

+ \\Vt{W-1)Vt\\- 

We define the random variables Za and Xa as in the proof of Lemma [T] and bound 
their variance as 

< mzi)\\ + mzc.f\\ 

\ {2niyr+\\Wf) 



< 



If , \ Anvr 

= [2nvr+{-^ + lf)<—, (31) 

\ 8^/r J 

and their norm as WXa^ \\ < 2vnr/ra. Using the operator Bernstein inequality yields 

Lemma 4 (Large deviation bound for the projected sampling operator). 

V{\\rT'R-VT - ^tII >t)< 4nrexp f-^) , (32) 



Mv J ' 



for all 1/(47?) < i < 4. 
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where we have also used (pOl to bound the second term in (27i. Thus, up to an 
irrelevant constant factor, Lemma|4]replaces Lemma [T] wherever it is used. 
To also replace Lemma[3] let F e T, ||F||2 = 1 and note that 



IVt^FW < \\Vt{'R--W)F\\ 



The random variables are Z^- 
the variance is bounded by 



{n'^/m)V^Va,F and = 



(33) 

E(Z„J where 



< 



< 



mzi)\\ + mza.n 



1 

64^ 



< 



2i/ 



(34) 



which gives, together with || < 2\/2v j (y/rn), and an application of the operator- 
Bernstein inequality the subsequent statement. 

Lemma 5 (Bound for the orthogonal projection). LetF e T andl/ {2y/r) < t/\\F\\2 < 
l^flfr. Then 

P[I1P^F|1 >i] <2nexp(^-^^^^). (35) 

Lemma |5] takes the place of Lemma |3] and, again, differs only by a constant factor 
in the exponent which concludes the proof of Theorem|2] 

An example for a Fourier-type frame for which /i(C) 7^ is given by the following 
situation. Here, with some probability, every Hermitian matrix with unit Frobenius 
norm is drawn in the measurement. 

Example 1 (Tight frame containing all Hermitian matrices). Any G i3((D") with 
\\wa\\2 = 1 can be viewed as a vector on the dimensional unit sphere. Therefore, 
on can define a rotationally invariant Haar measure on it. The tight frame formed 
by the Haar measure on all Hermitian matrices with ||wq||2 = 1 fulfills Theorem^ 
Therefore, it allows for efficient compressed sensing. 



In order to satisfy Theorem|2] we have to show 



P 



>-]< 



(36) 



where v ^ O (poly log [n)). To see that this is true, we note that we are dealing with 
a normalized version of the extensively discussed Gaussian unitary ensemble (GUE) 
denoted by {wa}, — Wa/\\wa\\2- Now for all (5 > 0, e > we have 



P 



> 



< P 



> 



6e 



P(||Wa||2 > e) 



The first term can be bounded using a result from Ref. 1161 yielding 

P(||iD„|| > fc/Vn) <ciexp(-C2n(fc- 2)3/2) 



(37) 



(38) 



where Ci , C2 > are small constants while for the second term we use the properties 
of the Xfe-distribution which are given the appendix. From this, we get 



Pdlw'alla > <exp(V"V4) 



(39) 
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We set y = 1/2 and see that (36 1 is fulfilled for some constant v when n is large 
enough. 

Product measurements are of great experimental importance: They describe the 
situation of addressing individual quantum systems, say, ions in an ion trap experiment 
or individual modes in an optical one. They are described by tight frames which are 
formed as tensor products of tight frames on the local systems. Given a tight frame 
which fulfills < v jd, one can obtain a tight frame on the n = d'' dimensional 

Hilbert space by forming the /c-fold tensor product. The strongest possible incoherence 
property we can obtain is HwqIP < jn. Unless = 1, as for the Pauli matrices, v 
grows too fast to allow for efficient compressed sensing for all states. This is even true if 
the incoherence condition may be violated on some set C with /i(C) = 0(l/poly(n)). 

2.6 Non-Fourier-type efficient compressed sensing 

The conditions in Theorem |2] imply that efficient compressed sensing is possible for 
any low -rank state p. This is a quite special situation and for Theorem |2] to be ful- 
filled, either a very special structure, like the one of the Pauli basis or a large 
amount of randomness, like in the above example, is needed. As an example for a very 
different situation, consider the state p = |0)(0| together with the observables which 
corresponds to the sampling of single matrix-entries (or the Hermitian combinations 
of two of them). Here, one needs to take Q{r?) attempts until one "hits" the single 
non-zero entry in the upper-left corner. This is not surprising because the operators in 
this basis fulfill = ©(I)- However, for most of the states, efficient compressed 

sensing is indeed possible in this basis. In Theorem [3] we give a sufficient condition 
for combinations of states and tight frames to work. 

Theorem 3 (Non-Fourier-type efficient compressed sensing). For a given tight frame 
\wa I a G S\, and a given rank-r state p, denote by C <Z S the set of observables for 
which at least one of the following conditions is not fulfilled: 

2i/r 

WVrWaWl < , (40) 

n 

(u'a,sgnp)2 <^^. (41) 

tf l^-iC) < (16v^7i^m)^^, efficient compressed sensing is possible for the state p. 

The golfing scheme works exactly like in the Fourier-type case, as does the proof 
of Lemma [T] However, Lemma |5] must be replaced by something else. Again, we 
use the technique of conditioning which means that we assume the incoherence con- 
dition to hold for all operators in the tight frame and the tight frame condition to be 



approximately true as in ( 27 1. First, we need some preparation. 



Lemma 6 (Bound to the scalar product). Let F £ T such that \\F\\2 < f, l/(4-yr) < 
f/t < l^fljr, and 

iwa,Ff<^ (42) 

for all a €z S. Then 

P (r^7^F|| >t)< 2nexp (-^) ■ (43) 
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Proof: We consider the same same random variables as in the proof of Lemma |4] 
(note that we have again set ||F||2 = 1) and bound their variance as 



/ f 1 

mxlM j {wo..Ff{^\wl\i;) + - 



64r) 



<- 
mnr ' 

where we have used the incoherence property and 

djl{a) wl 

To see that (|45b holds, we start with 



2 

< . 

n 



(44) 



(45) 



^ = y dM(a) wl = il- \C\) J dfi{a) <+ dM(a) wl (46) 

where the first equality follows directly from the tight frame property, c.f. Ref |2|, 
while the second one stems from the definition of the conditional probability distribu- 
tion Jl. Rearranging and taking the norm yields 



< 



1 



(47) 



which implies (|45| ). Using (44 1 together with WXa^ \\ < 2\^u/ i^/rn) in Lemma|2j we 
obtain Lemma 16] which concludes the proof. 

The above Lemma must by applied for F = X^, . . . , Xi, i.e., the operators occur- 
ring in the golfing scheme. By the second incoherence condition, (42i is fulfilled for 
F = Xq. To ensure that incoherence is preserved during the golfing scheme, we must 
use a more complicated and technical argument than in Ref. |2 | where a union bound 
over all elements of the operator basis was used which is clearly impossible in a tight 
frame with an infinite number of elements. 



Lemma 7 (Replacing the union bound). 

where ^ (F) is the smallest number such that 



M£,{F)v J ' 



(48) 



16-\/r7^ 



Proof: We fix an element wp from the tight frame and note that for F e T 

\{wp,VT{n-i)F)\ <\{wp,VT{n^yv)F)\ 

+ \{w0,Vt{W-1)F)\. 



(49) 



(50) 



The latter term is smaller than WW — 1||||F||2. To bound the former term, we define 
the random variable 



1 

Xa, ^ —{wp,F) - (Wp, VTWa,)iWa^,F) 

m m 



(51) 
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whose variance is bounded by 

\nxl] < ^^^^^ + AllW - IIPII^^II^ (52) 



and II^qJI < 2{l-\-nvr )^£XF)/m. Using once again the operator Bernstein inequal- 
ity yields after squaring 

P {{^,,{l-VTnVr)F? > Imij < 2exp [^^^^^^^ ] ■ (53) 



Eq. (53 I says that the desired property is true with high probability for any fixed WjS. 
To show that it is also true with high probability for most of the operators, we need a 
simple fact from probability theory, which is shown in the appendix. 

Lemma 8 (Inverting probabilities). Let X and Y be two measure spaces and denote 
by X ^ y a relation between the elements x ^ X and y Cz Y. If 

yx e X : ¥{x ^ y\y eY) <p (54) 

then 

lPOP{x^y\xeX)> p\yeY)<^ (55) 



Applying this to p3\ and using the definition of S,{F), one directly obtains ( |48)l 
which completes the proof of Lemma|7] Now, we can see that iJ.{Xi) < 2^'^y/ri//n? 
which means that Lemma |6] can be applied in the golfing scheme and we have proven 
Theorem [3] 



2.7 Reconstructing generic quantum states 

In a next step, we investigate the reconstruction of random quantum states, that are 
sampled from probability measures that are invariant under the action of the unitary 
group by conjugation. We show examples of tight frames that satisfy the incoherence 
properties required in Theorem|3]to allow reconstruction of most quantum states. 

Theorem 4 (Incoherence properties of generic states). Let {wa)a<£S be a (family of) 
tight frame for which all operators fulfill ||wq.||i = 0(polylog(n)), and pick a ran- 
dom rank r quantum state p, with a distribution that is invariant under the action of 
the unitary group. Then the probability that p cannot be efficiently reconstructed by 
compressed sensing vanishes as 0(l/poly (n)). 

Note that Theorem|4]holds for all unitarily invariant measures on the quantum states 
of rank r regardless of the actual distribution of the eigenvalues. 

Proof: We first show that for any fixed element of the tight frames, both incoherence 
properties are fulfilled with high probability. First, we turn to 

\\Ptw\\1^ \iU^^U\j\^ (56) 

i j'|inin(i j')<r 

where L'^ is a unitary matrix which is chosen according to the Haar measure and we 
have fixed an element w from the tight frame. We look at the ith row of WwU and 
note that J2j — We write wj for the jth column vector 
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of w and note that Wwj/\\wj \\ is just a random vector on a sphere . Thus, the squares 
of its coordinates are concentrated around 1/n, c.f. the appendix, and we get 



Using this in Eq. (56 1, inserting J2i ll^dP — \\M\2 ~ 1' ^^'^ applying ^ union bound 
yields 

1Pu(^\\Vtw\\1>^^ <2nrexp(-'^y (58) 
Employing again Lemma |8] this implies 

P. (p. (iWr^Wl > ^) > ^^-i^) < 32Wme.p (-0 . (59) 

where w is chosen according to the probability distribution of the tight frame. By al- 
lowing ly to grow polylogarithmically in n, this probability vanishes polynomially in n 
which means that it is violated to much only for a proportion of state vanishing poly- 
nomially as n grows. Now we turn to the second non-Fourier incoherence condition. 
Decomposing it; as a sum of projectors on orthogonal subspaces w — J^k '^fc I ^fc) {'^k\, 
we can write 

r 

|Ksgnp)|<^^|Afe||(*|C/t|vI/,)|2. (60) 

i=0 k 

Using the concentration of measure on the sphere and J^k l^fel ~ II^IU yields 

r v , 



n 



P ( {w,sgnpf > -^\\w\\l ] < 2?lr■e"^/^ (61) 

which finally gives 



P[/ I P^ I (u;,sgnp)2 > ) > ^ 



< 32r^/^mn-' exp I 1 . (62) 



Since the additional factor of r can be absorbed in i^, Theorem|4]follows from Eq. ( [62| . 

Tight frames for which this is the case include those where the rank of the operators 
does not grow with n. The other extreme is given by the Pauli basis: From \\w\\2 ~ 1 
and \\w\\ = 1/y/n it follows that ||w||i = y/n. Colloquially speaking, a small spectral 
norm implies a large trace norm and vice versa. Thus, we have two classes of tight 
frames (Fourier likes ones and the ones with small 1-norm) for which efficient com- 
pressed sensing is efficiently possible. Because they represent in some sense the two 
extreme cases (flat spectra vs. concentrated spectra), we have some reason to believe 
that this is indeed true for any tight frame. 



3 Certification 
3.1 Ideal case 

Theorems |2] and [3] show that efficient compressed sensing is possible in a vast number 
of situations. They are stated in the asymptotic regime for clarity but could be furnished 
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with reasonable prefactors for finite Hilbert-space dimension n. However, when using 
compressed sensing in actual experiments, one encounters three main problems. 

• Firstly, the necessary number of measurements as calculated from the incoher- 
ence properties of the employed tight frame might still be too large to be feasible. 

• Secondly, repetition of the experiments to increase the probability of success to 
a satisfactory value may be expensive or difficult. 

• Thirdly, it is unknown how close to low-rank the state actually is. After all, no 
assumptions are made about the unknown input state. 

The solutions to those problems is provided by certification. Instead of theoretically 
constructing some certificate based on p with the help of the golfing scheme, we use 
the solution of the minimization problem <t* to explicitly check whether the conditions 
for Theorem [U are satisfied for a* . The candidate for the certificate can be calculated 
as 

Y = nVT'iVT'nVT'y^ sgna* (63) 

where Vt' is obtained Uke Vt but with p replaced by a* and M^^ denotes the Moore- 
Penrose pseudo inverse of M. One can now check whether (j5]l is fulfilled. If the 
conditions for Theorem[T|are fulfilled and \\a* \\i = 1, the solution must be unique and 
equal to the state p, i.e. tomography was successful. 

3.2 Errors and noise 

For compressed sensing to work in a realistic setting, the reconstruction procedure must 
be robust, i.e., small errors introduced by decoherence, errors stemming from imper- 
fect measurements, and statistical noise due to the fact that every observable is only 
measured a finite number of times, should only lead to small errors in the reconstructed 
state. In addition, the Hilbert space might be infinite-dimensional. When the mean 
energy, and therefore, the mean photon number A'mean, is finite, the error made by 
truncating the Hilbert space at photon number TV vanishes as 

l!pt.u„c- Pill < 3^1^ = £ (64) 

which is shown in the appendix. This means that the expectation values with respect to 
the truncated state are close to the actually measured ones, i.e., 

|Tr(u;ptrunc) - Tr(w/9)| < e, (65) 

for all w such that ||w|| < 1. 

We assume that the observed data correspond to a matrix p (not necessarily a state) 
with WVizip ^ p)||2 < (5 where p is the low-rank, infinite-dimensional state, i.e., the 
errors made by truncating to a finite-dimensional Hilbert space are already included in 
S, and where we denote by V-jz the projection to the image of the sampling operator 
Such a tube condition is satisfied with very high probability for realistic error models 
like Gaussian noise |lj[18j. We relax the conditions in (|2]i to 

WVni'T - p)h < 5. (66) 

The solution of the SDP might not be of low rank. Because a low-rank state is needed 
for the construction of the certificate Y, we truncate a* to the q largest eigenvalues 
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(call this a*) and obtain Vt' as above. As r = rankp is in general not known, one has 
to perform the truncation of a* and the subsequent construction of the certificate Y for 
q = 1,2,... until a valid Y, as to be specified below, has been found. If this is not the 
case, the number of measurements was not enough and needs to be increased. 

To provide an error bound, we denote the 2-norm error made by the truncation of 
a* to rank g by e and obtain from the triangle inequality 

\\Vnia;-p)h = \\Vn{<j;-<y*)\\2 + \\VTz{<J* -p)h + WPnip-p)h < e + 2S. (67) 

We calculate a candidate for a certificate as y = TZVt' {Vt'T^'Pt')'^ sgn a* where T' 
is obtained from a*. If F is valid, i.e., WVt'^YW < 1/2, and Vt'V-rVt' > {p/'2)'Pt' 
with p = m/'n?, then the proof of Theorem 7 in Ref. IfTSl yields the robustness result 




p||2< £^+2 (2^ + e). (68) 



By the equivalence of the norms, this also provides a 1-norm bound at the expense of 
an additional factor ^/n. 

Thus, with no further assumption than 2-norm closeness of the observations to 
the state of interest it is possible to obtain a certified reconstruction which is also 
close to the state of interest. In this sense, quantum compressed sensing can achieve 
assumption-free certified quantum state reconstruction in the presence of errors. This 
discussion applies to box errors, where each of the expectation values is assumed to be 
contained in a certain interval. The discussion of other error models will be the subject 
of forthcoming work. 



4 Universal quantum compressed sensing 
4.1 Universal quantum state reconstruction 

The preceding discussion has focused on claims of the following form: 

For every low -rank state p, most choices of the observables wi , . . . , Wm 
can be used to successfully reconstruct p. 

In some situations, however, one can actually prove a much stronger statement, in 
which the order of the quantifiers is reversed: 

Most choices of the observables , . . . , Wm will have the property that, 
for every low-rank state p, the observables wi, . . . , Wm can be used to 
successfully reconstruct p. 

This is known as universal reconstruction; more simply, it says that a fixed set of ob- 
servables wi , . . . , Wm can distinguish among all low -rank states simultaneously. Be- 
sides being of conceptual interest, universal reconstruction also implies stronger error 
bounds for reconstruction from noisy data. 

Formally, we say that our method performs universal compressed sensing if p/„ < 
1/2, where p^^ is the "universal" failure probability. That is, we define p/„ to be the 
probability (with respect to the choice of observables wi, . . . , w,„) that there exists 
a state p (with dimension n and rank r) such that the method fails with probability 
> 1/2 (where this last probability is taken with respect to the random measurement 
outcomes). 
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Definition 4 (Efficient universal quantum compressed sensing). Universal compressed 
sensing (with dimension n and rank r) is regarded as efficient if: the number of mea- 
sured observables satisfies m = 0(nr poly log (n)), and the probability of failure sat- 
isfies pfu < 1/2. 



4.2 Universal reconstruction using any Fourier-type tight frame 

In this section, we show that measurements using a Fourier-type tight frame lead to 
efficient universal quantum compressed sensing. This result can be viewed as a com- 
panion to Theorem [2] Essentially, it says that, by using a slightly larger number of 
measurements (by a polylog(7i) factor), one can construct (with high probability) a sin- 
gle, fixed set of measurements that can reconstruct all possible states of rank r and 
dimension n. In addition, universal reconstruction implies very strong error bounds, in 
the case of noisy data; we will discuss this in the following section. 

Tlieorem 5 (Universal reconstruction). Let [w a) a^s be a tight frame. Letv = 0(polylog(n)), 
and suppose that, for all a G S, < v/n. Then efficient universal compressed 

sensing (for states of rank r and dimension n) is possible. 

This proof of this theorem is a straightforward generalization of fS). First, we define 
the sampling operator to be : (C"^" — > R™, 

Tl 

{A{(j)), = ^{wla), i = l,...,m. (69) 



This is related to the notation used in previous sections by A^A = TZ. (As before, 
the observables w'l, . . . , wj„ are sampled independently from the distribution /i on the 
tight frame, and (A, B) = Tr{A^B) is the Hilbert-Schmidt inner product.) 

A key tool in the proof is the restricted isometry property (RIP) |23|. We say that 
A satisfies the RIP if there exists some constant 6 G [0; 1) such that, for all rank-r 
n-dimensional states a, 

(l-^)lk||2 < \\A{a)\\2 < (l + <5)||a||2. (70) 

In geometric terms, the set of all low-rank states forms an 0(rn)-dimensional manifold 
in (C"^", and A satisfies the RIP if it embeds this manifold into (D™, with constant- 
factor distortion. 

The importance of the RIP stems from the following fact: when A satisfies RIP, 
one can reconstruct any low-rank state p from noiseless data A{p), by solving a trace- 
minimization convex program: 

min||fT||i, subject to ^((t) = ^(p). (71) 



This follows from a standard argument of 112311 . This result can be generalized to the 
case of noisy data; we will discuss this in the following section. 

It now remains to prove that, when the observables Wa are chosen from a Fourier- 
type tight frame (i.e., they satisfy ||wa|p < I'/n), the sampling operator A satisfies 
RIP with high probability. Intuitively, one first shows that, for any particular low-rank 
state cr, and a random choice of measurements w'l,. . . jw'^^, the sampling operator 



A satisfies equation ( 70 1 with high probability. After this comes the main part of the 
argument. Let p/ (cr) denote the probability of failure on a given state cr. One now needs 
to upper-bound the probability of a failure on any one of the states cr. The simplest 
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approach is to assume that the failure events are disjoint, and so the probabilities sum 
up — this is the union bound, and it does not give a useful bound in this case. Instead, 
one uses an "entropy argument" that exploits the fact that failure events are not disjoint: 
failures on nearby states are correlated. 

Formally, the entropy argument is carried out using Gaussian processes and Dud- 
ley's inequality (following EOl 1211 ). and using bounds on covering numbers of the 
trace-norm ball due to 1221 . The proof is essentially the same as in |6J; the original 
proof in |6| handles the case where the Wa form an incoherent orthonormal basis, but 
the same proof goes through unchanged for a Fourier-type tight frame. This shows 
that, if the number of measurements satisfies m > Cvrn log® n (for some constant C), 
then with high probability the sampling operator A satisfies the RIP (for rank r and 
dimension n). 



4.3 Robust reconstruction from noisy data 

More interestingly, RIP implies strong error bounds in the case of noisy data lfT3l . We 
sketch the basic idea here. Suppose one observes y — A{p) + z, where z denotes a 



noise component. Then one can replace (71 1 with other estimators, such as the matrix 
Dantzig selector: 

min||cr||i suchthat \\A\y - A{(j))\\ < \, (72) 

or the matrix Lasso: 

inm\\\A{a)^y\\l+Mi- (73) 

(See Ref. |fT3]| for details about setting the parameters A and /i.) 

When the noise vector z is normally distributed, one can show particularly nice 
error bounds. These hold even for states p that are full-rank (though p must at least have 
decaying eigenvalues, for the bounds to be useful) llT3l (see also |l6l). Suppose that p is 
arbitrary, and one simply assigns some value for r, and measures m = 0(i/rn log® n) 
observables. Then let a* denote the solution returned by either of the above estimators. 
Intuitively, one expects that a* should reconstruct the first r eigenvectors of p. One can 
prove a bound that is consistent with this intuition: the squared 2-norm error ||cr* — pjjj 
will be nearly proportional (up to log factors) to the total variance of the noise acting 
on the first r eigenvectors of p, plus the squared 2-norm of the "tail" of p (consisting 
of its last n — r eigenvectors). 



5 Applications 

We now demonstrate how our theory can be applied to some common experimental 
setups in quantum optics. We show how pointwise measurements of the Wigner func- 
tion, and histograms obtained using homodyne detection, can be expressed as mea- 
surements using tight frames, and generalized tight frames. Furthermore, we propose 
efficient compressed sensing schemes (with Fourier-type tight frames) using these mea- 
surements. 



5.1 Homodyne detection 

The most common way to do quantum state tomography on continuous-variable light 
modes is based on homodyne detection, which is done by combining the light field with 
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a mode in a strong coherent state, called the local oscillator, in an interferometer and 
measuring the difference of the intensities on the two output ports |29l|30l[3T]. This 
amounts to sampling x G R according to the one-dimensional probability distribution 
given by the Radon transform (at angle 6) of the Wigner function, i.e., 

^eix) = J W{xcos6 — psin6,xsm9 + pcoa9)dp. (74) 

The angle 9 is chosen by phase-shifting the mode with respect to the local oscillator. 

For a general quantum state with maximal photon number N, N + 1 equidistant 
choices of 9 G [0, tt) are sufficient and necessary to reconstruct the state by an inverse 



Radon transform of Eq. ( 74 1 or by using pattern functions |29 30). Here we show how 
these measurements can be described by a generalized tight frame. A tight frame by 
itself does not suffice because here every measurement setting, i.e., every choice of 9, 
does not only give a single number as a result but an entire distribution Tg. 

A key observation is that the Fourier transform of the probability distribution (|74]i 
is identical to the characteristic function, i.e., the Fourier transform of the Wigner func- 
tion, written in radial coordinates. We define 



W{u,v) = J dx dpW{x,p) exp[—i{ux + vp)] 



(75) 



which fulfills 

Pe (C) = W^(C cos 61, C sin 9) (76) 

where Pe(C)== / dxPe(C) exp(-iCx). 

This allows us to write the projector (corresponding to measurement setting 9 and 
outcome Q as 

CPe (0) = W^b) {r\ (C cos 0, C sin 9)W;^ (C cos 9, ( sin 9) . (77) 

Because choosing a measurement setting does not mean choosing values for 9 and (, 
but rather only choosing a phase 9 and obtaining a whole "slice" of the characteristic 
function, the operator corresponding to a measurement setting is 

Ve^ J dCVeiO- (78) 

It is easy to check that Vg fulfills 

- / d9Ve = — , (79) 







which implies that it satisfies Definition |2] and forms a generalized tight frame. 

5.2 Efficient compressed sensing using homodyne measurements 

In the previous subsection, we have introduced the generalized tight frame correspond- 
ing to homodyne detection. This can be combined with the convex program in equation 
Q to perform state reconstruction. In Section[6] we show by means of a numerical sim- 
ulation that this procedure performs well in practice. However, our theoretical analysis 
does not apply to this procedure, due to the generalized tight frame; it would be inter- 
esting to try to extend our theoretical results to this case. 
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In this section, we present a different way of using homodyne detection to recon- 
struct low-rank states, which is a little less direct, but does have a rigorous guarantee 
of success. We will do three things. First, we will show how homodyne measurements 
can be used to estimate expectation values of displacement operators. Then, we will 
use (scaled) displacement operators to construct a tight frame. Finally, we will show 
that this tight frame has Fourier-type incoherence. By combining these pieces, we will 
then get an efficient compressed sensing scheme. 

Before continuing, we note that D{a) cannot be directly measured as it is not Her- 
mitian. However, one can also use 8-port homodyning to directly measure the ob- 
servables |a)(Q;| = D{a)\0) {0\D^ {a) ||^8|. Because the experimental effort is higher, 
compared to standard homodyning, we will not discuss this scheme here. 

Define the displacement operators 

= e-l"l'/2e"°*e-"*", a G (D. (80) 

Note that we have the identities D{a) = e«"*-"'° = eH^^^-a'a^aa^ 
Now recall the definition of the characteristic function j 



C(")(/3) =Tr(e*'^'^^'+''3*», /? e C. (81) 

Setting a ~ i/3, we see that C'''^(/3) is precisely the expectation value of the displace- 
ment operator D(a). On the other hand, C'*'(/3) is also equal to W{l3), the (two- 
dimensional) Fourier transform of the Wigner function W{£^). This in turn is related. 



via equation (76 1, to the probability distribution 'Pg{x), which we can sample using 
homodyne detection. 

Thus, we can estimate the expectation value of a displacement operator D{a) as 
follows: set (5 — —ia, and make homodyne measurements with phase angle 6 = 
arg(/3). This produces several points xi, . . . G R sampled from the distribution 
Wg{x). Then set ( = \/3\, and compute j Yfi=i 6xp(— iC^^i)- This gives an estimate 
for Pe(C) — W{13) = C''^^ which is the desired expectation value. 

Note that a lossy detector (i.e., one with efficiency less than 1) has the effect of 
convolving the true Wigner function W{£,) with a Gaussian, to produce the empiri- 
cally observed Wigner function ll33l . This is equivalent to pointwise multiplying the 
characteristic function C'''''(/3) with a Gaussian envelope. We can compensate for this 
by re-scaling C^'^^P) at each point (3, provided that our raw estimates of C''*^(/3) are 
sufficiently precise, and the detector efficiency is not too poor. 

Next, we will construct a tight frame using the displacement operators D{a). Note 
that the D{a) form an orthonormal basis for the state space f32l : 

p=-l D{a)Tr{D{a)^ p)da, for all states p, (82) 

where we are taking a 2-dimensional integral over the complex plane. Now suppose 
we sample a from a 2-dimensional Gaussian distribution on the complex plane with 
width (T (which we will choose later). This distribution has probability density 

^G(a) = 5^e-H■^/^'^^ (83) 
Define scaled displacement operators 

D{a) = y2crel"l'/'''"' (84) 
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Then we can rewrite ( (82] l as 



p = / D{a)TT{D{a)^ p)PGia)da, for all states p. (85) 
J<c 

This is (up to normalization) a tight frame for the full, infinite-dimensional state space. 

In fact, we are only interested in the finite-dimensional subspace consisting of states 
with at most photons; this subspace is isomorphic to So we will 

truncate the above operators. Let IlAr be the projector onto span{|0), |1), . . . , 
(where the \ j) are Fock basis states). Then define truncated displacement operators 

DN{a) ^nND{a)nN, and Z)Ar(a) = nAri)(a)nAr. (86) 

Then the operators — ]vqr[^w(a) form a tight frame for as de- 

sired: 



{N+l] 



iP = 



f Wc,TT{wip)PGia)da, for all p G C^^+^^^^^+i). (87) 



Finally, we set a = \/'2N log(l + 'iN), and we claim that the above tight frame 
{wa} is Fourier-type incoherent, in the sense of Theorems |2] and |5] More precisely, we 
claim that 

\\DN{a)\\ < V2ea = 2e log(l + 4iV), for all a eC; (88) 
we will prove this below. This directly implies 



\wa\\ < ^ for all a eC. (89) 



Then, by Theorems |2]andl5 
We now show why ([88 



we have an efficient compressed sensing scheme. 
I holds. First, note that while the displacement operators 
D{a) are unitary, the scaled operators D{a) are unbounded. However, when a is 
small, this is not a problem. In particular, when \a\ < 2a, we can just use the trivial 
bound 

\\DNia)\\ < \\Dia)\\ < N/2ael"l'/4^' , (90) 



which implies (88 1. 

It remains to consider the case where \a\ > 2a. In this case, D{a) is large, but 
it acts mostly on states with more than n photons, so the truncated operator I?7v(a) is 
small. Using a straightforward calculation, we can bound DN{a) in the 2-norm, which 
implies (88 1. See the appendix for details. 



5.3 Pointwise measurements of the Wigner function 

A quantum state p of a single optical mode can be represented in phase space by a real 
Wigner function Wp : IR f2W\. For a single mode it is given by, c.f. Ref. | 



(91) 



where (—1)" is the parity operator where £ = {x,p) G R^, D{£^) is the displacement 
operator which becomes the one defined in ( 80 1 by setting a — {1/ \/2) (Ci + ^^2 ) ■ With 
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the same convention, we will, whenever it is convenient, regard the Wigner function as 
a function of a complex variable. 



Eq. (91 1 allows pointwise measurement of the Wigner function by a displacement 
in phase space followed by a measurement of the parity of the photon number. This has 
already been experimentally performed for optical fields in a cavity Ii24i and for pulsed 
single photons (for the special case of a rotationally invariant state) 1251 . We consider 
a single mode containing up to N photons and, therefore, Hilbert space dimension 
n = N + 1. Measuring the Wigner function at a point a amounts to a measurement of 
the observable 

= V2TT'^D{a){~lfD^{a). (92) 

TT 



We make again use of the probability density Pq of Eq. (83 i and define scaled, trun- 
cated operators Wa = n^^PG{a)^^^^IlNUJa^N- They form a tight-frame on the 
truncated Hilbert space when the sampling is performed according to Pq. 

We now proceed exactly as in the previous section to show that the operator norm 
of the w is small enough for the Fourier type incoherence property of Theorem |2] We 
do not give explicit constants but focus on the asymptotic scaling in n. If |a| < 2a, 
we get the bound ||wq|| < 4ecr/n. We will show that if we set a = y^logn one has 
II exp(|ap/(2cr2))ii;„|| < 1 for all a with |q:| > 2a whenever n is large enough which 
implies the requirements of Theorem |2] We need the matrix elements {l\wa\k) = 
W\k){i\{a)- To calculate them, first let ^ ~ {x,p) and recall the definition of the 
Wigner function Ii28l : 



Wii)(^k\ix,p) = IJ dyib:ix + y)Mx-y)e^''"'- (93) 

where we remember the identification a = {l/V2){x + ip). Inserting the eigen- 
functions of the harmonic oscillator tpi, using the properties of the occurring Hermite 
polynomials, and performing the integral allows to write 



(94) 



with the generating function 

G{x,x',p) = e-p'+2v(:r-^')-2x:r'^ (95) 

From this, one gets the bound, which is by no means tight but strong enough, | VK|fe)(/| {a)\ < 
rt"(2|a|)" exp(— 2|a|2) which allows us to write 

II exp(|a|V(2a2))u;„|| <|| exp(|a| V(2^72))u,„||2 



<exp -2|ar + (n + 2)logn + 2nlog(2|a|) 



lap 



2a 



(96) 



We now set a = -y/n log n and get, for large enough n, a bound valid for all a with 
I a I > 2a which reads 

II exp(|a|V(2cr^))u;a|| < exp {-2nlog^ n + {n + 2) \ogn + 2nlog{4:^/nlogn)) . 

(97) 

As the first term in the exponent grows fastest, one has || exp(|a|2/(2cr2))ii;Q|| < 1 
for sufficiently large n. Thus, there is some C > such that ||wq|| < C \ogn/ y/n 
which means that the pointwise Wigner function measurement is of Fourier type and, 
therefore, can be used for efficient compressed sensing. 
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Figure 1 : (color online) Reconstruction of a random pure state on 4 qubits by Pauli- 
measurements. Red triangles: Probability of successful state recovery. Blue circles: 
Probability of successful certification. 



6 Numerical examples 

We now present some examples which show the performance of certified compressed 
sensing for randomly chosen states. We demonstrate the method for small-dimensional 
noiseless states and defer a detailed analysis of the method, especially in the presence 
of noise and decoherence, to a subsequent publication. For small systems, the condition 
C3 < 1 in Theorem[T|is hard to satisfy. However, the conditions for uniqueness can be 
replaced by (a') Ci := sgnp|j2 — Oand(b')c2 '■— li'Pj^yH < 1, discarding the 



condition on C3, because these conditions imply that the expression in ( 1 1 1 is positive, 
which guarantees that any feasible change in the solution will be 1-norm increasing. 

Figure [T] demonstrates certified compressed sensing for the very important case of 
the Pauli basis. It is clearly visible that the certificate is only a sufficient condition and 
not a necessary one as it is possible that the reconstruction is successful but no valid 
certificate is produced. It is also apparent that the overhead in the number of queries 
needed for certification is actually quite reasonable. 

For the tight frame consisting of all Hermitian matrices, as shown in Figure|2] it is 
interesting to note that taking global random observables performs superior to taking 
tensor products of local random observables. The intuitive reason for this is provided 
by concentration of measure. By considering a distribution of observables which is 
invariant under the action of the unitary group on the full system, the proportion of 
observables that are not Fourier-like, i.e., whose operator norms are too large, is much 
smaller Thus, more information is obtained per observable which leads to a faster 
reconstruction. 

Figure |3] illustrates that compressed sensing also works using optical homodyne 
detection with a generalized tight frame, c.f Subsection 5.1 In Figure|4] we show the 



reconstruction of a single mode optical state based on the measurement of expectation 



values of displacement operators as discussed in Subsection 5.2 
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Figure 2: (color online) Reconstruction of a pure state on 4 qubits. Red triangles 
(blue circles): Probability of successful state recovery (certification) for local random 
measurements. Green squares (black crosses): Successful state recovery (certification) 
for global random measurements. 
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Figure 3: (color online) Reconstruction of a random state with rank 5 on 3 modes 
with up to 2 photons each by optical Homodyne detection. Red triangles: Probability 
of successful state recovery. Blue circles: Probability of successful certification. 
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Figure 4: (color online) Reconstruction of a random state of a single optical mode, 
truncated at the 15-th number state, by measuring expectation values of 2-norm normal- 
ized displacement operators D{a) where |q:|| is chosen uniformly at random between 
and 5 while arg(Q!) is chosen uniformly between and 2tt. Red triangles: Probability 
of successful state recovery. Blue circles: Probability of successful certification. 

7 Summary 

In this article, we have presented a general theory of quantum state tomography for 
continuous-variable systems using compressed sensing. We have used tight frames 
to describe continuous measurement families, which are very natural in a plethora of 
physical situations. We have shown how our theory applies to prominent and fre- 
quently used techniques in quantum optics, in particular, pointwise measurements of 
the Wigner function, and homodyne detection. 

• We have explored different incoherence properties sufficient for efficient com- 
pressed sensing. Improved results using Fourier-type tight frames were presented 
in Theorem|2] Also, it was shown in Theorem|3]that for every tight frame whose 
operators fulfill 

=0(polylog(n)), (98) 

most states (i.e., all but a proportion l/poly(n) thereof) can be reconstructed 
from merely 0(npolylog(n)) expectation values. It would be interesting to 
extend these results to generalized tight frames. 

• We have introduced the idea of certified compressed sensing which allows to get 
rid of all assumptions and guarantee successful state reconstruction a posteriori. 
This assumption-free certified quantum state reconstruction is possible even in 
the presence of errors. 

• Furthermore, we have shown universal compressed sensing results for any Fourier- 
type tight frame in Theorem |5] This implies strong error bounds in the case of 
noisy data. 

• We have presented numerical results showing the practical (non-asymptotic) per- 
formance of these methods. It would be interesting to investigate this further, in 
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particular to other types of feasible measurements, and to apply these ideas on 
other physical systems as well. 
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Appendix 

Properties of the x^-distribution 

In order to be self-contained, we repeat two simple bounds to the tails of a dis- 
tributed random variable X which can be found in Ref. ITSl . A right-sided bound 



IS 




(99) 



while a left-sided one is 




(100) 
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Random vectors on a sphere 

A random vector v E C" on a sphere can be obtained by choosing an vector v G M^" 
with Gaussian entries and normaHzing. Doing so yields 



To bound the first term, one can use ( 99 1, obtaining 



P I \v,\ > < cxp f-^ I (102) 



£\/n 



while for the second terms the inequality ( 100 1 leads to 

P(|l«f < <cxp(^-^j . (103) 

Setting e = 1/2 finally gives 

P (1^,1 > 6/Vn) < 2cxp (^-^-Pj . (104) 

Proof of Lemma M 

Proof: From 

V(P{x'^y\xeX)> /3\yEY)<^ (105) 

it foUows that 

IPix '/^ y\x e X,y eY) < p. (106) 
We assume now the contrary of ( |105| l, i.e., 

P(P(x7^2/|a;eX) >/3|yey) > ^ (107) 

from which follows 

lPix^y\xeX,yeY)>p, (108) 
which is a contradiction to ( |106| l and, therefore, concludes the proof. 

Truncating the Hilbert space of a continuous-variable-Ught mode 

We show how large the Hilbert space must be to describe a continuous-variable-light 
mode with bounded energy, i.e., bounded photon number Let p be the state of interest, 
^mcan its mean photon number, and ptrunc the truncation of p to the first N Fock layers 
which is not normalized 

oo oo 
-^mcaii — ^ ^ ^ Pn.n 

>iN + l) 

> (iV+l)Tr(ptrunc-p). (109) 



From this we obtain 



Tr(p-pt™nc)<^^. (110) 



To get from ( 1 10 1 an error to the 1-norm we need 
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Lemma 9 (Truncation of matrices). Let M be a positive semidefinite matrix, or a 
trace-class operator, written as 

where A and C are square matrices. It is true that 

\\B\\l<\\A\\4C\\i. (112) 



Inserting ( 1 12 1 with M = p into \\\Qi) and employing the triangle inequality yields 
with ||A||i < 1. 



llptrunc - Hll < ^ + < 3^^, (113) 



as long as N + 1> Nn 



Proof of Lemma |9] 

We decompose the Hilbert space according to the block structure of ( |1 1 \\ as EQ)F and 
write M as M = J^k ^^^k where the Mk are rank one projectors with Ak, Bk, and Ck 
as in (pTT]) and A > 0. Now, we write AA4 = |*fc)(*fc| with \^k) = a/cl^fe) + ^felV^fe) 
where 10^) e E and \ipk) G P- From this, one obtains immediately 

\\Bk\\l = lafcHbfep = \\Ak\\i\\Ck\\i. (114) 

To conclude the proof, we write 

iiBiii <Eii^fciii ^E^^^v^^ 

k k 

where we have used the Cauchy-Schwarz inequality. 



Proof of equation (88 1 



It remains to consider the case where \a\ > 2cr. We start by bounding the matrix 
elements of the displacement operator D{a): 



{k\D{a)\tj = e-l"l'/2(fc|e"'^^-«*«|£), 



(116) 



i=0 
k 



4=0 



(117) 



=E7fv/fc---(fc-.7 + l)(fc-j|=E(SVfc---(.7 + l)(jl, (118) 
3=0 i=o 



min(/c,^) 

(fc|i?(a)K)=e-H^/2 ^ (C;)/"(S)!V fc^^^O^ + l)v/^^^(J• + l)■ (ll9) 

3=0 
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Using the Cauchy-Schwarz inequality, and the binomial theorem, 



inin(fc,£) 



1/2 



j=0 



min(fc,£) 



--'"'"'1 E [ E 



J=0 



1/2 



-j) 



j=0 



j=o 



1/2 



= e-HV2(l+|^|2)fe/2(^^|^|2^£/2_ 



(120) 



Note that, for any fixed k and £, this quantity decays exponentially as |a| becomes 
large. 

We now consider the iV -photon truncated operator D^ia). We can bound it in 
2-norm as follows: 



||i?iv(a)||2 < e-HV2 [ 5: (1 + |a| Y(l + Ic^n^ 

k,i=0 



= e-|«lV2 



2-jW+l 
(1+1^ 

< e-l«l'/2(l + |a|2)^+i|a|-2 = e-HV2(i + |a|2)^(i + |«|-2). 

Then we can bound the scaled tnmcated operator -DAr(a) as follows: 
||^iv(a)||2 < V2acxp(g - M!)(i + |«|2)^^(i + |„|-2) 



V2aexp[^ - M! + iVlog(l + \a\^)]{l + \a\-^). 



(121) 



(122) 



Let 



(123) 



E:=^-^-^-^+Nlog{l + \a\') 

be the quantity inside the exponential; we will upper-bound it. Note the following 
identity, for any x,xo G (0, oo): (by approximating log(l -|- a;) to first order at the 
point X = Xq) 

log(l + a;) < log(l + xo) + ^= log(l + xq) + ^ - 1. (124) 
Set a; = |a|^ and xq = 4:N, then we have 

log(l+|an < log(l+4iV)+i^-l < log(l+47V)+i±^-l < log(l+47V)+^ 



Then 



E<{^-^ + l)\a\^+n\ogil+4N). 
Using the fact that a>2a= ^y8N\og{l + 4iV), we get 

E<{^-l + l + l)\a\' = {-^, + ^)\a\^ 



4N ■ 

(125) 



(126) 



(127) 
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Plugging this into ( 122\ , we get 

\\DN{a)\\2 < V2aeMi-l + 4^)l«l'](l + l«r')- 
Using the fact that <t > 2 and |q;| > 2(t > 4, we have that 



(128) 



a 



< %/2crexp(-l)i^ < V2cr. (129) 



Since the operator norm is upper-bounded by the 2-norm, this implies (88 i. 
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